690 research outputs found

    Extensions of the External Validation for Checking Learned Model Interpretability and Generalizability.

    Full text link
    We discuss the validation of machine learning models, which is standard practice in determining model efficacy and generalizability. We argue that internal validation approaches, such as cross-validation and bootstrap, cannot guarantee the quality of a machine learning model due to potentially biased training data and the complexity of the validation procedure itself. For better evaluating the generalization ability of a learned model, we suggest leveraging on external data sources from elsewhere as validation datasets, namely external validation. Due to the lack of research attractions on external validation, especially a well-structured and comprehensive study, we discuss the necessity for external validation and propose two extensions of the external validation approach that may help reveal the true domain-relevant model from a candidate set. Moreover, we also suggest a procedure to check whether a set of validation datasets is valid and introduce statistical reference points for detecting external data problems

    Conservation gap analysis of dipterocarp hotspots in Sarawak using GIS, remote sensing and herbarium data

    Get PDF
    Dipterocarpaceae is the dominant tree family in the tropical rain forests of Southeast Asia. Borneo is the centre of diversity for the dipterocarps. Identification of hotspots is important for forest and biodiversity conservation efforts. Species Occurrence Models (SOMs) were generated for all 247 species of dipterocarps recorded in Sarawak using herbarium occurrence data and based on the best model selected. The species occurrence density map for each genus and category (endemic and non endemic) was generated by overlaying the SOMs of all species in each genus or category. The species occurrence density maps were analyzed with land cover map from Landsat 7-EMT+ images and protected forest areas for identifying hotspots for conservation in Sarawak. Overlaying the SOM maps revealed that areas in central Sarawak and the southwest region (northwest Borneo around Kuching) are the main hotspots of dipterocarp diversity in Sarawak while the coastal lowland areas in the lower Rejang and Baram River which are mainly peat swamp forest are poorer in species occurrence density. In terms of endemism, as with dipterocarp diversity, the mixed diptercarp forest of central Sarawak is also the most important hotspot. Gap analysis revealed that most protected forest areas are in southwest Sarawak (Bako, Kubah, Tanjung Datu and Gunung Gading National Parks) and in the northern part of Sarawak (Niah, Lambir Hills and Mt Mulu National Parks). This leaves the hotspot in the central part of Sarawak least protected. Protected areas only cover between 2 and 4% of the total areas for the different hotspots (75% species density) while majority of the hotspots that are still forested are outside the protected areas

    Estimating logged-over lowland rainforest aboveground biomass in Sabah, Malaysia using airborne LiDAR data

    Get PDF
    Unprecedented deforestation and forest degradation in recent decades have severely depleted the carbon storage in Borneo. Estimating aboveground biomass (AGB) with high accuracy is crucial to quantifying carbon stocks for Reducing Emissions from Deforestation and Forest Degradation-plus implementation (REDD+). Airborne Light Detection and Ranging (LiDAR) is a promising remote sensing technology that provides fine-scale forest structure variability data. This paper highlights the use of airborne LiDAR data for estimating the AGB of a logged-over tropical forest in Sabah, Malaysia. The LiDAR data was acquired using an Optech Orion C200 sensor onboard a fixed wing aircraft. The canopy height of each LiDAR point was calculated from the height difference between the first returns and the Digital Terrain Model (DTM) constructed from the ground points. Among the obtained LiDAR height metrics, the mean canopy height produced the strongest relationship with the observed AGB. This single-variable model had a root mean squared error (RMSE) of 80.02 t ha-1 or 22.31% of the mean AGB, which performed exceptionally when compared with recent tropical rainforest studies. Overall, airborne LiDAR did provide fine-scale canopy height measurements for accurately and reliably estimating the AGB in a logged-over forest in Sabah, thus supporting the state's effort in realizing the REDD+ mechanism

    Epidemiological analysis of typhoid fever in Kelantan from a retrieved registry

    Get PDF
    Aim: Despite the endemicity of typhoid in Kelantan, epidemiological data showing typhoid association to age, sex,ethnicity and district of patients is limited. This retrospective study investigated the statistical association of thesevariables from a retrieved registry.Methodology and results: Cross-tabulation using SPSS was used to analyze 1394 cases of confirmed typhoid patientsadmitted to various hospitals in Kelantan state over a six-year period. Fourteen age groups with a five-year rangeinterval were generated. There was a significant association between typhoid infection and sex of subjects, wherebyfemales were generally more susceptible than males. Ethnicity and district of typhoid patients did not show significantassociation.Conclusion, significance and impact of study: The observation of an increased number of typhoid cases with a malepredominance in the age group 5-14 and female predominance in the 20-60 age group calls for improved hygiene,continued public health education, together with better laboratory diagnostics to identify carriers, are some measures tocontrol this disease

    Quantum Physics and Human Language

    Get PDF
    Human languages employ constructions that tacitly assume specific properties of the limited range of phenomena they evolved to describe. These assumed properties are true features of that limited context, but may not be general or precise properties of all the physical situations allowed by fundamental physics. In brief, human languages contain `excess baggage' that must be qualified, discarded, or otherwise reformed to give a clear account in the context of fundamental physics of even the everyday phenomena that the languages evolved to describe. The surest route to clarity is to express the constructions of human languages in the language of fundamental physical theory, not the other way around. These ideas are illustrated by an analysis of the verb `to happen' and the word `reality' in special relativity and the modern quantum mechanics of closed systems.Comment: Contribution to the festschrift for G.C. Ghirardi on his 70th Birthday, minor correction
    corecore